PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG005689t1
Common NameTCM_005689
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 296aa    MW: 32669.6 Da    PI: 8.588
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG005689t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox55.78.1e-18130184256
                       T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       rk+ +++keq   Lee F+++++++ +++  LAk+l+L  rqV vWFqNrRa+ k
  Thecc1EG005689t1 130 RKKLRLSKEQSLLLEETFKEHSTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTK 184
                       788899***********************************************98 PP

2HD-ZIP_I/II125.42.5e-40130219191
       HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreel 91 
                       +kk+rlskeq+ lLEe+F+e+++L+p++K +la++L+l+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l+een+rL+kev+eLr +l
  Thecc1EG005689t1 130 RKKLRLSKEQSLLLEETFKEHSTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCENLTEENRRLQKEVQELR-AL 219
                       69*************************************************************************************9.55 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046181.8E-364100IPR006712HD-ZIP protein, N-terminal
Gene3DG3DSA:1.10.10.603.7E-18104187IPR009057Homeodomain-like
SuperFamilySSF466898.13E-19117187IPR009057Homeodomain-like
PROSITE profilePS5007117.281126186IPR001356Homeobox domain
SMARTSM003896.2E-15128190IPR001356Homeobox domain
CDDcd000861.27E-14130187No hitNo description
PfamPF000463.0E-15130184IPR001356Homeobox domain
PROSITE patternPS000270161184IPR017970Homeobox, conserved site
SMARTSM003405.7E-27186229IPR003106Leucine zipper, homeobox-associated
PfamPF021832.4E-11186220IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 296 aa     Download sequence    Send to blast
MGAEKDDGLG LSLSLGCAQN HPSLKLNLMP LASPRMQNLQ QKNTWNELFQ SSDRNLDTRS  60
FLRGIDVNRA PATVDCEEEG GVSSPNSTIS SISGKRNERD PVGDETEAER ASCSRASDDE  120
DGGAGGDASR KKLRLSKEQS LLLEETFKEH STLNPKQKLA LAKQLNLRPR QVEVWFQNRR  180
ARTKLKQTEV DCEYLKRCCE NLTEENRRLQ KEVQELRALK LSPQLYMHMN PPTTLTMCPS  240
CERVAVSSSS SSAAATASST PTSTVPNRHH RTSSVSPWAA MPIGHRPFHA PASRS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1128134SRKKLRL
2178186RRARTKLKQ
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007052281.10.0Homeobox-leucine zipper protein 4 / HD-ZIP protein
SwissprotP466021e-119HAT3_ARATH; Homeobox-leucine zipper protein HAT3
TrEMBLA0A061E2C40.0A0A061E2C4_THECC; Homeobox-leucine zipper protein 4 / HD-ZIP protein
STRINGPOPTR_0014s04460.11e-144(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM25772772
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G60390.13e-86homeobox-leucine zipper protein 3
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]